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ABSTRACT Through the use of genetic, enzymatic, metabolomic, and structural analyses, we have discovered the cataboUc path- 
way for proline betaine, an osmoprotectant, in Paracoccus denitrificans and Rhodobacter sphaeroides. Genetic and enzymatic 
analyses showed that several of the key enzymes of the hydroxyproline betaine degradation pathway also function in proline be- 
taine degradation. Metabolomic analyses detected each of the metabolic intermediates of the pathway. The proline betaine cata- 
bolic pathway was repressed by osmotic stress and cold stress, and a regulatory transcription factor was identified. We also re- 
port crystal structure complexes of the P. denitrificans HpbD hydroxyproline betaine epimerase/proline betaine racemase with 
L-proline betaine and cis-hydroxyproline betaine. 

IMPORTANCE At least half of the extant protein annotations are incorrect, and the errors propagate as the number of genome 
sequences increases exponentially. A large-scale, multidiscipUnary sequence- and structure-based strategy for functional assign- 
ment of bacterial enzymes of unknown function has demonstrated the pathway for catabolism of the osmoprotectant proline 
betaine. 
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Most bacterial habitats are dynamic. The survival and prolif- 
eration of bacteria depend upon their ability to adapt to 
changes in the surrounding environment in such categories as 
nutrient availability, pH, temperature, and solute concentrations 
in order to persist and flourish. Hence, bacteria have evolved di- 
verse pathways to overcome unfavorable environmental changes 
in their microenvironment. Changes in osmolarity are among the 
more common stresses bacteria encounter, and they respond by 
accumulating high concentrations of inorganic ions (K+), amino 
acids (proline, glutamate), carbohydrates (trehalose, sucrose, 
glycerol), and/or methylated compounds (glycine betaine, proline 
betaine [Pro-B], hydroxyproline betaine [Hyp-B], dimethylsulfo- 
niopropionate) (1, 2), often referred to as compatible solutes. Ac- 
cumulation of compatible solutes has also been reported to coun- 
ter other environmntal stresses such as cold stress (3, 4). 

Betaines are quaternary amine derivatives of amino acids that 
often serve as osmoprotectants (Pro-B is N,N-dimethylproline 
and is also called stachydrine). Plants and marine algae are the 
major producers of Pro-B and are known to accumulate high con- 
centrations of betaines under stress conditions (5-7). Pro-B is 
excreted from germinating seeds and plant roots into bacterial 
habitats, and thus, bacteria growing in close association with 
plants have evolved pathways to utilize betaines for two distinct 
purposes: as osmoprotectants under high-osmolarity conditions 



and as carbon and nitrogen sources in the absence of osmotic 
stress (8-10). Reasonable pathways for Pro-B catabolism have 
been proposed (9, 1 1, 12) but have remained without biochemical 
support. 

We previously reported a pathway prediction approach based 
on genome context plus in silico docking of putative substrates in 
modeled and determined protein structures (13). Put simply, the 
substrate or product of one enzyme may be the product or sub- 
strate of the prior or next enzyme of the pathway. If so, the sub- 
strates and products of genome-context-related enzymes should 
be structurally similar. Consistent results obtained by metabolite 
docking of several genome-neighborhood-related enzymes allow 
more-reliable enzyme function predictions. Here, we have built 
upon this work to delineate the Pro-B catabolic pathway of the soil 
bacterium Paracoccus denitrificans and the aquatic photosynthetic 
bacterium Rhodobacter sphaeroides. Regulation of transcription of 
the Pro-B catabolic genes triggered by changes in osmotic stress 
and temperature is reported. 

RESULTS AND DISCUSSION 

The salient clues to the tHyp-B (fraMS-4-hydroxy-L-proline be- 
taine) degradation pathway determined earlier (13) were the re- 
sults of in silico docking of a metabolite library in the active site of 
previously determined Pelagibaca bermudensis HpbD structure 
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Pelagibaca bermudensis HTCC2601 

hpbA hpbBl hpbCl hpbD hpbH hpbl hpbJ hpbK hpbL hpbC2 hpbB2 hpbR putA 
^^M " 1623 1628 4863 4943 " 18653 18658 18663 "^^^ 

Paracoccus den itrif icons PD1222 

hpbK hpbL putA hpbM hpbD hpbCl hpbBl hpbR hpbX hpbY hpbZ hpbA hpbC2 hpbB2 
0015 0016 '"^^^""^■^^l 1188 1189 1190 2831 2832 

78% 84% 64% 64% 61% 59% 59% 83% 62% 58% 

hpbN hpbO hpbH hpbJ 



58% 22% 



Rhodobacter sphaeroides 2.4. 1 



putA hpbK hpbL hpbA hpbR hpbBl hpbCl hpbJ hpbl hpbH hpbD 
^^••■2575 2576 ^977 3978 3979 

66% 70% 81% 78% 36% 57% 62% 64% 76% 69% 63% 

FIG 1 Genome contexts of HpbD in P. bermudensis and the orthologous genes in P. denitrificans and R. sphaeroides. The genes encoding orttiologues are given 
tlie same color, and the sequence identities relating orthologous genes are indicated. More detail is given in Table Si in the supplemental material. Genes 
separated by a string oi^ dots are unlinked in the genome. Note that in vivo study of P. bermudensis was precluded by its slow growth and very low maximal cell 
density and by a lack of genetic tools. 



plus similar docking of homology models of two proteins, P. ber- 
mudensis HpbJ (pfoHpbJ) andpfoHpbBl, encoded by neighboring 
genes. These investigations led to the prediction that the sub- 
strates of HpbD were the trans and cis isomers of Hyp-B and/or the 
L and D forms of Pro-B. HpbD would catalyze the inversion of 
chirality at Ca and thus perform epimerization on the hydroxy- 
proline derivatives and racemization of the proline derivatives. 
Indeed, HpbD was found to have activity with both frans- Hyp-B 
and L-Pro-B (13). A P. denzYri/icflns protein having sequence (64% 
identity) and genome context similarities to P. bermudensis HpbD 
(Fig. 1) was also shown to have both activities and thus was named 
pdHpbD. Modeling of pdHphD based on the P. bermudensis 
HpbD crystal structure gave a satisfying model into which Pro-B 
was readily docked. We have now buttressed this finding by deter- 
mination of the 2.0-A crystal structure of unliganded pdYbphD 
and the 1.6-A and 1.7-A structures of the complexes oipdWiphD 
with L-Pro-B and with cis-Hyp-B, respectively (see Fig. SI and S2 
and Table SI in the supplemental material). The modeled and 
determined structures of the L-Pro-B complex withpdHpbD agree 
very well (Fig. 2). A protein from R. sphaeroides 2.4.1 encoded in a 
similar genome context (Fig. 1) was also predicted to contain the 
same binding site residues (Fig. 2) and thus seemed very likely to 
catalyze the Pro-B racemase reaction. This was confirmed exper- 
imentally (Table 1; see also Fig. S3 in the supplemental material), 
and the protein was named rsHpbD. 

P. denitrificans utilizes l- and D-Pro-B as carbon and nitro- 
gen sources at low salt concentrations and as osmoprotectants 
at high salt concentrations. When grown in minimal media lack- 



ing a carbon and/or nitrogen source, P. denitrificans and _R. spha- 
eroides readily utilized either the l- or D-enantiomers of Pro-B as 
sole sources of carbon and nitrogen (Fig. 3A). The osmoprotective 
effects of L- and D-Pro-B were tested under aerobic conditions in 
minimal medium containing glucose as the carbon source and 
growth-inhibitory concentrations of NaCl. The l and d forms of 
Pro-B act as osmoprotectants, although D-Pro-B was the some- 
what more efficient osmolyte (Table 2 and Fig. 3B and C), perhaps 
because it was metabolized more slowly. 

Prediction and genetic testing of the Pro-B degradation 
pathway. We had earlier shown that the proteins and enzymes 
encoded in the genome neighborhood of HpbD in P. bermudensis 
constitute a catabolic pathway for tHyp-B (13). Based on the re- 
actions catalyzed by members of the families and superfamilies of 
the proteins encoded within the hpbD genome neighborhood 
(Fig. 1; see also Table S2 in the supplemental material), we pre- 
dicted a catabolic pathway that degrades l- or D-Pro-B to glutamic 
acid (Fig. 4). In the case of D-Pro-B, HpbD would catalyze racem- 
ization to L-Pro-B. The L-Pro-B would be demethylated to 
N-methyl-L-proline by one or both of the Rieske-type proteins 
encoded by HpbBl-HpbCl and HpbB2-HpbC2, and the 
N-methyl-L-proline would be demethylated to L-proline by 
HpbA, a flavin-dependent enzyme. The pathway would conclude 
by the action of PutA, a proline dehydrogenase-l-pyrroline-5- 
carboxylate dehydrogenase. PutA is a bifunctional enzyme that 
catalyzes two consecutive reactions that oxidize L-proline to 
L-glutamic acid. Deamination of glutamic acid would give 
2-ketoglutarate, the tricarboxylic acid cycle intermediate, and 
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FIG 2 (A) Comparison of two homology models, pdHpbD (magenta) and rsHpbD (orange), with the X-ray structure ofpdHpdD (cyan; PDB 4110). (B) The 
Glide XP docking positions of proline betaine in the two models are highly similar to that of the cocrystallized proline betaine of the X-ray structure. Asp 194, 
Glu219, and Asp242 are metal (shown in spheres) binding residues. Lysl64 and Lys266 are two catalytic bases, whereas Trp312 and Asp293 are important 
substrate binding residues. 



thereby allow Pro-B to support growth as the sole source of both 
nitrogen and carbon (Fig. 4). 

Another possibility was that the two demethylases could con- 
vert D-Pro-B to D-proline, which would be converted to L-proline 
by a proline racemase and thereby allow D-Pro-B to be used as a 
carbon and nitrogen source. However, this pathway was rendered 
unlikely by the finding that neither P. denitrificans nor R. spha- 
eroides can grow on D-proline as the sole carbon source (Table 2), 
although both bacteria utilize D-proline as an osmoprotectant, 
indicating that D-proline is transported into the cytosol. Hence, 
these bacteria are probably unable to utilize D-Pro-B due to lack of 
a proline racemase. This is surprising because it has been reported 
that 42% of the total free proline in P. denitrificans is the D-isomer 
(14). 

To test the hypothesized pathway, strains lacking the racemase 
were assayed for the ability to grow on the l and d forms of Pro-B 
as the sole carbon source (Fig. 5A). hpbD strain RPdl lacking the 
racemase grew well on L-Pro-B but failed to grow on D-Pro-B, 
whereas the wild-type strain and the hpbD strain carrying a plas- 
mid with hpbD grew well on both isomers. Identical results were 
seen for the hpdD strain of R. sphaeroides (Table 3). 

It was recently demonstrated that the Sinorhizobium meliloti 



TABLE 1 Kinetic constants of HpbD enzymes of R. sphaeroides and P. 
denitrificans 



Enzyme and 








Source or 


compound 




K„ (mM) 


(M-i s-i) 


reference 


rsHpbD 










L-Pro-B 


74 ± 30 


440 ± 300 


170 


This study 


D-Pro-B 


4.3 ± 0.6 


11 ± 4 


390 


This study 


tHyp-B 


70 ± 7 


16 ± 7 


4,300 


This study 


cHyp-B 


58 ± 6 


190 ± 30 


300 


This study 


pdHpbD 










L-Pro-B 


67 ± 10 


33 ± 9 


2,000 


11 


D-Pro-B 


28 ± 6 


7.3 ± 3 


3,800 


This study 



Rieske protein demethylates L-Pro-B (15). Wild-type growth of 
strain RPdl on L-Pro-B suggested that at least one of the two P. 
denitrificans Rieske-type proteins (encoded by hpbBl and hpbCl 
or hpbB2 and hpbC2) was responsible for the first demethylation 
of L-Pro-B. The genes encoding both sets of Rieske-type proteins 
were separately disrupted to give strains that grew slowly with l- or 
D-Pro-B as the sole carbon source but had wild-type growth rates 
with L-proline as the carbon source, suggesting that the two gene 
sets had redundant functions (Table 2). Hence, a strain was con- 
structed that had disruptions of the genes encoding both Rieske- 
type proteins. This strain failed to grow on l- or D-Pro-B but 
showed wild-type grovrth with N-methyl-L-proline, the product 
of the first demethylation (Fig. 5B and Table 2). Growth on l- or 
D-Pro-B was restored upon complementation of the double- 
mutant strain with hpbBl and hpbCl expressed on a plasmid from 
its native promoter (Fig. 5B). Thus, both Rieske-type proteins are 
demethylases active on L-Pro-B. 

The gene encoding the N-methyl L-proline demethylase 
(hpbA) predicted to remove the second Pro-B methyl group was 
disrupted to give a strain that failed to grow on N-methyl-L- 
proline or on either Pro-B isomer (Fig. 5C and Table 2). Growth 
was restored upon complementation with a plasmid expressing 
the hpbA gene from its native promoter (Fig. 5C). 

Metabolomics confirms the predicted Pro-B catabolic path- 
way. The pathway for D-Pro-B degradation (Fig. 4) predicts that 
Pro-B, N-methyl-Pro, proline, and glutamate should be observed 
in metabolite extracts of wild-type P. denitrificans grown with 
D-Pro-B as the sole carbon source but not in cells from cultures 
grown with glucose as the carbon source. Moreover, extracts of a 
strain fed D-Pro-B that lacked HpbD and both sets of Rieske-type 
proteins (strain RPdl4) should contain only Pro-B and the amino 
acids normally present, proline and glutamate. (In the feeding 
experiments, carbon-starved wild-type cells were briefly fed d- 
Pro-B; see reference 11.) Extracts of these strains were analyzed by 
liquid chromatography-Fourier transform mass spectrometry 
(LC-FTMS). The liquid chromatographic retention times of me- 
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Time (h) 

FIG 3 Effects of L- or D-Pro-B on growth of wild-type P. denitrificans as the 
carbon source and as an osmoprotectant. (A) Growth without added NaCI. 
Growth on glucose is also shown. (B) Growth in the presence or absence of 0.5 
M NaCl with or without L- or D-Pro-B supplementation. The L- or D-isomers 
of ProB were used at a concentration of 20 mlvl. (C) Effects of addition of the 
L- or D-isomers of ProB on growth of wild-type R. sphaeroides in minimal 
media containing 0.5 M NaCl. Glucose (20 mlVI) was present in all cultures. 



tabolites from cell extracts were confirmed via accurate mass mea- 
surements ( ± 5 ppm) obtained with the mass spectrometer, and 
the peaks matched the elution times of Pro-B, proline, and gluta- 
mate standards separated by several liquid chromatographic pro- 
tocols. In extracts of cells grown on D-Pro-B (the wild-type strain) 
or fed D-Pro-B (mutant strain RPdl4), similar Pro-B levels were 
detected (Fig. 6; see also S5 in the supplemental material). How- 
ever, only the wild-type cells produced N-methylproline, the 
product of the first demethylation. The third metabolite in the 
pathway, proline, was detectable in all samples, but the wild-type 
cells accumulated nearly 400-fold-more proline than the mutant 



strain (Fig. 6E). As expected, wild-type cells grown with glucose as 
the sole carbon source lacked detectable levels of ProB and 
N-methyl proline but showed proline levels similar to those of the 
deletion strain fed D-Pro-B. Glutamate, the product of the bifunc- 
tional PutA proline dehydrogenase-l-pyrroline-5-carboxylate 
dehydrogenase, was detected in all three samples (Fig. 6E). An 
additional metabolomics experiment (not shown) gave intensity 
patterns similar to those of the D-Pro-B-grown cells. Note that 
accumulation of D-Pro-B in the mutant strain demonstrated that 
importation of D-Pro-B is not coupled to its catabolism. 

Transcriptional regulation of Pro-B catabolism. In the ge- 
nomes of P. bermudensis, P. denitrificans, and _R. sphaeroides, a 
gene encoding a LysR homologue ihpbR) is found adjacent to the 
hpbBl and hpbCl genes that encode a Rieske-type demethylase 
(Fig. 1). LysR transcription factors comprise one of the largest 
families of prokaryotic transcriptional regulators and regulate a 
very diverse range of genes (16). LysR proteins can act as either 
repressors or activators of transcription upon binding a small- 
molecule ligand (16, 17).Thegene encoding the candidate P. deni- 
trificans LysR protein (HpbR) was disrupted to see if it played a 
role in Pro-B catabolism. The resulting strain (RPd5) showed slow 
growth with l- or D-Pro-B, but normal growth with L-proline, and 
thus had a phenotype very similar to that of the strain lacking 
hpbBl and hpbCl (Table 2). This result argued that HpbR acti- 
vates transcription of the neighboring hpbBl and hpbCl Rieske- 
type protein genes and perhaps other genes of the Pro-B catabo- 
lism pathway. However, the slow growth observed in the absence 
of HpbR could be due to a high basal level of expression of the 
target genes or, as in the case of the Rieske demethylases, the pres- 
ence of another set of genes encoding a duplicate function that 
does not require HpbR activation. 

The most straightforward candidate for the small-molecule li- 
gand of HpbR would be a proline betaine. If so, HpbR was ex- 
pected to contain the aromatic cage often found in betaine bind- 
ing sites. Hence, the P. denitrificans protein (pdHpbR) was 
modeled on the structure of the Neisseria meningitidis CrgA pro- 
tein (PDB 3HHG). The CrgA protein was chosen based on BLAST 
analyses using the substrate binding domain of HbpR (residues 98 
to 291) as the query plus its crystal structure. Although another 
sequence gave a slightly better sequence match, it was an inferior 
template because its crystal structure (PDB 2QSX) contains two 
disordered segments. Although the overall amino acid sequence 
identity between pdHpbR and CrgA is relatively low (21%), the 
modeled pdHpbR ligand binding site residues Tyrl47, Trp200, 
and Trp242 form an aromatic cage capable of the cation- tt inter- 
actions used in betaine binding. Two residues. Asp 163 and 
Argl94, that could act to shape the binding site electric field are 
also present. This homology model was used for induced- fit dock- 
ing with Pro-B as the ligand (see Fig. S4 in the supplemental ma- 
terial). In the liganded model, the quaternary ammonium of the 
ligand fits well into the aromatic cage, whereas Argl94 forms salt 
bridges with the carboxylate group of the ligand and Asp 163 in- 
teracts with the positively charged quaternary ammonium group. 
The HpbR proteins of P. bermudensis and -R. sphaeroides were also 
modeled, and these proteins also formed potential aromatic cage 
structures (see Fig. S4). 

Pro-B induces increased transcription of the genes of the 
pathway. To investigate the role of Pro-B and HpbR in expression 
of the P. denitrificans and _R. sphaeroides Pro-B degradation genes, 
cultures of the wild-type strains were grown with sole carbon 
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TABLE 2 Growth phenotypes of P. denitrificans amd R. sphaeroides wild-type and mutant strains" 



Plienotype 



Genotype 


Strain Glucose Succinate 


L-Pro-B D-Pro-B 


L-Pro N-methyl-L-Pro 


Wild type 




-t- -t- -t- -!--!--}- 


-t- -t- -t- -t- -t- 


hpbD 


Krai + + ^ + 


-t- -t- -t- — 


-t- -t- -t- -t- -t- 


Snpuni rlpuLyl 


Kr Qz + + ^ 


-t- -t- -t- -t- 


-t- -t- -t- -t- -t- 


snpbBz hpbCz 


RPd3 -1- + + -1- + -1- 


-t- -t- -t- -t- 


-t--t--t- -t--t- 


hpbBl hpbCl and hpbB2 hpbC2 


RPd4 -1- + + -1- + + 




+ + + + + 


hpbR 


RPdS + + + 


+ + + + 


+ + + + + 


hpbA 


RPd6 + + + 




+ + + 


RPdl complemented with hpbD 


RPdlO + + + 


+ + + + + + 


+ + + + + 


RPd2 complemented with hpbBl and 


RPdll + + + + 


+++ +++ 


+ + + + + 


hpbCl 








RPd6 complemented with hpbA 


RPdl2 + + + + 


+++ +++ 


+ + + + + 


putA 


RPdl3 + + + 








2.4.1 + + + 


+ + + + 


+ + + + 


hpbD 


RRsl + + + + + + 


+ + + 


+ + + + + 


" + + + , growth similar to that of the wild-type 


strain (note that none of the strains grew on D-proline) 


; + + and + , slow-growth phenotypes 


, — , growth-defective phenotype. 



Assays were performed in triplicate {from five separate cultures). 



sources of glucose, L-proline, or l- or D-Pro-B in minimal me- 
dium, and quantitative reverse transcription-PCR (qRT-PCR) 
transcriptional analyses were performed. Relative to glucose- 
grown cells, transcription of the racemase gene, hpbD, and the 
genes encoding all demethylase proteins was increased 8- to 15- 
fold in cells grown on either l- or D-proline. In contrast, the ex- 
pression levels of L-proline-grown cells did not significantly differ 
from those of glucose-grown cells (Table 3). Transcription of sev- 
eral genome-proximal genes encoding putative transport proteins 
also increased in Pro-B-grown cells, but L-proline-grown cells also 
showed increased transcription. The putative transport genes had 
lower induction ratios than the catabolic enzyme genes as ex- 
pected from a high basal level of inducer transport required to 
initiate induction (Table 3). 

Pro-B induction also demonstrated hallmarks of secondary 
regulatory interactions. Expression of the gene encoding PutA, the 
enzyme that converts L-proline to glutamate, increased in Pro-B- 
grown cells, presumably due to accumulation of L-proline (cells 
grown on L-proline showed similarly increased putA transcrip- 
tion). Demethylation of Pro-B results in the synthesis of formal- 
dehyde, a toxic compound, and thus bacteria that utilize Pro-B as 
the carbon source should have a pathway to neutralize formalde- 
hyde. It has been reported that P. denitrificans can utilize methanol 
as a carbon source (18), which involves formation of formalde- 
hyde, and thus the bacterium has an efficient formaldehyde detox- 
ification pathway. The detoxification pathway is induced by form- 
aldehyde and includes a glutathione-dependent formaldehyde 
dehydrogenase (hpbBC) and an S-formylglutathione hydrolase 
{hpbL). Both genes showed a 20-fold increase in transcript levels 
in cultures grown on Pro-B, consistent with formaldehyde forma- 
tion as a product of Pro-B catabolism (Table 3). The JipbM gene, 
which encodes a GroES domain alcohol dehydrogenase with se- 
quence similarity to formaldehyde dehydrogenase, showed a 
5-fold upregulation in cells grown on Pro-B, suggesting a possible 
role in formaldehyde detoxification (Table 3). However, no in- 
crease in hpbM transcript levels was observed when cells were 
grown with methanol, suggesting that its expression is regulated 
by Pro-B but not by formaldehyde. 

Expression of the genes encoding the P. denitrificans HbpR 



LysR-type transcription factors was unaffected by growth on 
Pro-B (Table 3). However, disruption of the P. denitrificans hpbR 
gene severely decreased induction of the Pro-B catabolic genes, 
consistent with previously discussed evidence that HpbR is a key 
transcriptional activator that modulates response to Pro-B (Table 
3). 

A similar but less extensive set of qRT-PCR analyses were per- 
formed with the _R. sphaeroides wild- type strain (Table 4) . The data 
were very similar to those obtained with P. denitrificans. 

Salt stress and cold stress suppress the Pro-B catabolic path- 
way. This work has been mainly concerned with Pro-B as a carbon 
and nitrogen source. However, under osmotic stress conditions 
Pro-B catabolism must be largely suppressed to allow a high in- 
tracellular concentration of the osmolyte to accumulate in the 
bacterial cytosol. Moreover, Pro-B transport should increase to 
aid osmolyte accumulation. In accord with these predictions, in 
cultures of wild-type strains of P. denitrificans and R. sphaeroides 
grown on glucose in media containing 0.5 M NaCl, expression of 
the catabolic genes was severely repressed relative to that seen with 
cultures grown without added NaCl (Tables 3 and 4). Use of 
D-Pro-B as the carbon source in 0.5 M NaCl resulted in less repres- 
sion of the catabolic genes, although the expression levels re- 
mained well below those seen in the same medium lacking added 
NaCl. In contrast, NaCl addition did not suppress transcription of 
the genes encoding the putative transport proteins and, in the case 
of the hphX hpbY hpbZ gene cluster, salt stress gave increased 
transcription consistent with accumulation of the osmolyte. 
Therefore, salt stress suppresses expression of the catabolic path- 
way genes but not of the transport genes and thereby allows Pro-B 
to accumulate and function as an osmoprotectant. 

Although this scenario fits the terrestrial bacteria P. denitrifi- 
cans and _R. sphaeroides, the Pro-B degradation pathway was first 
identified in P. bermudensis, a marine bacterium. P. bermudensis 
was isolated from the open ocean (the Sargasso Sea) where salt 
concentration and hence osmolarity are essentially invariant. 
Hence, although P. bermudensis could take nutritional advantage 
of Pro-B produced by marine algae, the compound would likely 
not be used to counter osmotic stress. This raised the issue of the 
relevance of Pro-B accumulation for this bacterium. However, in 
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"O2CII 

L-Pro-B — 

L-Pro-B N -d em ethyl as e 
(hpbBI, hpbCI; hpbBZ, hpbCZt 



H 




Pro-B racemase 

{hpbD) 



N-Me-L -Proline 



-02Ciii\--^ 




N-Me-L -Proline N-demethylase 

ihpbA) 



L-Proline 



+O2 
-HCHO 



H 



OoCii 



*H2N 



Proline utilization A 

(putA) 



+0, 



L-Glutamate -02C''^-^---^C02 

FIG 4 Proposed pathway for L-Pro-B and D-Pro-B catabolism based on the genome neighborhood contexts in P. bermudensis, P. denkrificans, and R. 
sphaeroides. The interconversion of L-Pro-B and D-Pro-B is catalyzed by HpbD; L-Pro-B undergoes two N demethylations to L-Pro; and finally, L-Pro oxidation 
is catalyzed by a bifunctional enzyme PutA to give L-glutamate, which is readily deaminated to ammonia plus 2-ketoglutarate, the citric acid cycle intermediate. 



Bacillus subtilis and Listeria monocytogenes, betaines have been re- 
ported to play a significant role in cold tolerance (3, 4). Since 
marine bacteria experience temperature fluctuations upon depth 
changes, P. dentrificans and R. sphaeroides were tested to see if 
these bacteria respond to low temperature in a manner similar to 
the high-salt response. Indeed, in both bacteria, expression of the 
Pro-B catabolism genes in 15°C-grown cultures was significantly 
lower than in 30°C-grown cultures (Tables 2 and 4). Given these 
results, L-Pro-B supplementation would be expected to give in- 
creased growth of P. dentrificans at 15°C, but not at 30°C, and this 
was the case (Fig. 7). 

These results strongly suggest that plants and marine algae 
make the d enantiomer of Pro-B as well as the l enantiomer be- 
cause of the conservation of HpbD in plant- and alga-associated 



bacteria and its coregulation with the enzymes of L-Pro-B catabo- 
lism. However, in several literature searches, no reports in which 
the enantiomer composition of Pro-B isolated from nature has 
been assayed were found. 

All extant sequenced bacterial genomes were analyzed for the 
presence of genes encoding putative Pro-B metabolism proteins 
having more than 50% sequence similarity with the respective 
proteins of P. denitrificans. This gave a list of about 100 bacterial 
species (see Table S4 in the supplemental material). Interestingly, 
47% of the candidates are plant-associated bacterial species, 20% 
are marine bacteria that share a microenvironment with marine 
macroalgae, and another 8% are soil bacteria known to live in 
close proximity with plants. 

In the present study, the Pro-B catabolic pathway of P. denitri- 
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tool for identification and subsequent ex- 
perimental verification of novel pathways 
in diverse bacteria. 



Wild Type 
+ D-Pro-B 



hpbD + L-Pro-B 



hpbB1-hpbC1 
hpbB2-hpbC2 
+ N-Me-Pro 



hpbBI-hpbCI 
hpbB2-hpbC2 
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FIG 5 Growth phenotypes. (A) Growth of the P. denitrificans wild-type strain {Pdl222) or mutant 
strains (RPdl and RPdlO) in minimal medium with L- or D-Pro-B. The hpbD strain was RPdl, and the 
complemented strain was RPdlO. (B) Growth of the P. denitrificans wild-type stain (Pdl222) or mutant 
strains (RPd4 and RPdll) in minimal medium with D-Pro-B or N-Me-Pro. The strain missing both 
Rieske demethylases (genes hpbBl and hpbCl and genes hpbB2 and hpbCZ) was RPd4, and the comple- 
mented strain was RPdll. (C) Growth on N-Me-Pro as the carbon source. The P. denitrificans hpbA 
strain was RPd6, and the complemented strain was RPdl 2. 



ficans and -R. sphaeroides has been demonstrated. We propose that 
pathway docking in combination with computational biology, en- 
zymology, metabolomics, and microbiology provides a powerful 



MATERIALS AND METHODS 

Homology modeling and docking. The mod- 
els of pbHpbD, rsHpbD, and pdHpbR were 
built using our in-house software Protein Lo- 
cal Optimization (PLOP; marketed as Prime 
by Schrodinger LLC) v 25.0. The PDB tem- 
plate used for the two HpbD enzymes is 2PMQ 
and for pdHpbR is 3HHG. The sequence 
alignment of each pair of target and template 
was made using the L-INS-i method in 
MAFFT V 7.012b (19). Proline betaine was 
docked to the three models, and its three- 
dimensional (3D) conformation was prepared 
by LigPrep (20). Two docking methods were 
used. The Glide XP (21) docking method was 
used on two HpbD models; the details have 
been described previously. Induced-fit dock- 
ing (22) was used on prfHpbR, which allowed 
side chains within 10 A of the ligand to be flex- 
ible, and Glide XP was used for redocking. 

Cloning and expression of P. denitrifi- 
cans hpbD. The P. denitrificans hpbD gene was 
amplified from the genomic DNA using prim- 
ers PD1187F and PD1187R (see Table S2 in 
the supplemental material). PGR was per- 
formed using KOD Hot Start DNA polymer- 
ase (Novagen). The conditions were 2 min at 
gSX, followed by 40 cycles of 30 s at 95°C, 30 s 
at 66°C, and 30 s at 72°C. The amplified frag- 
ments were cloned into an N-terminal tobacco 
etch virus (TEV)-cleavable StrepII-6XHis 
tag- containing vector, pSGC, by ligation- 
independent cloning (23). All growth media 
contained 100 p.g/ml kanamycin sulfate and 
35 /ig/ml chloramphenicol. Plasmid was 
transformed into Escherichia coli 
Rosetta2(DE3) pLysS (Novagen) and plated 
on Luria-Bertani (LB) agar plates. Five to 10 
colonies were used to inoculate 100 ml of LB 
media and allowed to grow overnight at 37°C 
(250-rpm agitation). The overnight cultures 
were used to inoculate 4 liters of autoinduc- 
tion media in 10 2-liter flasks and incubated 
(250-rpm agitation) at 25°C for 24 h, reaching 
an optical density at 600 nm (ODjoq) of 20 to 
25 (24, 25). The cells were harvested by cen- 
trifugation and stored at — 80°C. 

Purification of pdHpbD. All puriflcation 
steps were performed at 4°C. Cells were resus- 
pended (4:1 [vol/wt]) in buffer A {50 mM 
2- [4-(2-hydroxyethyl)piperazine- 1 -yl] eth- 
anesulfonic acid [pH7.8], 150 mMNaCl, 10% 
glycerol, 20 mM imidazole} with 0.1% Tween 
20 and then disrupted by sonication. The cel- 
lular extract was clarified by centrifugation at 
4°C and appUed to a 10-ml nickel (Ni)- 
Sepharose HP (GE Healthcare) column. The 
column was washed with 10 column volumes 
of buffer A, and proteins were step eluted with 300 mM imidazole- buffer 
A. TEV protease (1/50 [wt/wt]) was added to the pooled protein- 
containing fractions, and the reaction was dialyzed overnight against 
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TABLE 3 Transcriptional analysis by qRT-PCR of wild-type P. denitrificans (strain PD1222) and P. denitrificans htpR strain RPd5° 



Fold change in gene expression 



Locus tag 



Glucose (0.5 M D-Pro-B (0.5 M 



Dtrain ana gene 


(Fden } 


L-Pro 


L-Pro-B 


D-Fro-b 


NaCl) 


NaCl) 


Methanol 


15°C 


P. denitrificans wild type 


















Racemase 


















hpbD 


1187 


1.1 ± 0.2* 


9 ± 1.5 


11 ± 1.5 


-7 ± 1.0 


3 ± 0.5 


— 


-2.5 ± 0.5 


Demethylases 


















hpbCl 


1188 


1.1 ± 0.2* 


14 ± 2.0 


13 ± 1.5 


-8 ± 1.0 


2.1 ± 0.5 


— 


-2.3 ± 0.3 


hpbBl 


1189 


1.5 ± 0.1* 


15 ± 0.9 


13 ± 0.5 


-5 ± 0.4 


2.8 ± 0.4 


— 


-2.7 ± 0.3 


hpbC2 


2831 


1.1 ± 0.3* 


7.7 ± 0.5 


9.2 ± 0.5 


-7 ± 0.3 


2.7 ± 0.4 


— 


3.1 ± 0.5 


hpbBl 


2832 


1.3 ± 0.2* 


7.8 ± 0.6 


10 ± 0.7 


-6 ± 0.5 


2.2 ± 0.3 


— 


-3.6 ± 0.4 


hpbA 


1197 


1.5 ± 0.2 


12 ± 1.1 


12 ± 0.8 


-5 ± 0.7 


2.7 ± 0.2 


— 


-2.7 ± 0.5 


Transcription factor 


















hpbR 


1190 


1.5 ± 0.1* 


1.9 ± 0.1* 


1.1 ± 0.1* 


1.1 ± 0.1* 


1.0 ± 0.2* 


— 


— 


Putative transporters 


















hpbX 


1193 


2.9 ± 0.2 


3.9 ± 0.3 


3.5 ± 0.7 


8.5 ± 1.0 


4.5 ± 0.2 


— 


— 


hpbY 


1194 


2.2 ± 0.2 


3.5 ± 0.5 


4.2 ± 0.4 


9.5 ± 0.6 


5.5 ± 0.5 


— 


— 


npu/L 


1 195 


2.1 it 0.1 


2.5 :t 0.4 


4.4 it 0.4 


10.5 it 0.5 


4.5 it 0.3 






hpbj 


4749 


2.5 ± 0.3 


6.5 ± 1.6 


7.2 ± 1.5 


5.2 ± 0.3 


10 ± 0.6 


— 


— 


hpbH 


4748 


2.4 ± 0.3 


7.1 ± 1.5 


6.9 ± 1.3 


4.3 ± 0.5 


14 ± 1.2 


— 


— 


hpbO 


4747 


2.3 ± 0.3 


5.5 ± 0.7 


6.5 ± 0.5 


4.9 ± 0.4 


15.5 ± 0.9 


— 


— 


hpbN 


4746 


2.5 ± 0.2 


5.5 ± 1.0 


7.5 ± 1.4 


4.2 ± 0.5 


12.5 ± 1.3 


— 


— 


Proline cataboUsm 


















putA 


94 


4.2 ± 0.4 


3.1 ± 0.5 


3.5 ± 0.5 


1.0 ± 0.1* 


2.5 ± 0.3 


— 


— 


hpbM 


1186 


1.5 ± 0.2* 


4.9 ± 0.5 


4.5 ± 0.4 


-1.1 ± 0.3* 


2.5 ± 0.2* 


4.5 ± 0.5 


— 


Formaldehyde detoxification 


















hpbK 


15 


1.1 ± 0.1* 


22 ± 2.5 


19.5 ± 3 


1.1 ± 0.1* 


6.0 ± 1.0 


25 ± 3 




hpbL 


16 


1.3 ± 0.2* 


21 ± 3.5 


19.8 ± 2 


-1.5 ± 0.1* 


7.5 ± 2.0 


28 ± 2 




P. denitrificans hpbR 


















Racemase 


















hpbD 


1187 


1.1 ± 0.2* 


3 ± 0.5 


5.0 ± 1.1 


-7.5 ± 1.0 


3.5 ± 0.5 






Demethylases 


















hpbCl 


1188 


1.2 ± 0.2* 


3.4 ± 2.0 


2.3 ± 0.5 


-8.5 ± 1.0 


2.5 ± 0.5 






hpbBl 


1189 


1.4 ± 0.1* 


2.2 ± 0.9 


2.2 ± 0.3 


-6.7 ± 0.4 


2.4 ± 0.3 






hpbC2 


2831 


1.1 ± 0.2* 


2.1 ± 0.5 


2.2 ± 0.5 


-7.1 ± 0.5 


3.7 ± 0.2 






hpbB2 


2832 


1.4 ± 0.1* 


2.3 ± 0.6 


2.1 ± 0.5 


-6.8 ± 0.5 


3.2 ± 0.3 







" The strains were cultured in minimal medium containing 20 mM glucose or succinate as the sole carbon source, and gene expression was compared to that of the strain grown in 
minimal media on different carbon sources (L-Pro-B, D-pro-B, L-proline, methanol, or glucose supplemented with 0.5 M NaCl) at either 30°C or 15°C. The identity of each gene is 
indicated by the genome locus name. Assays were performed in triplicate (four biological replicates), and the data are averages and standard deviations of the results. The average 
ratio is the fold difference compared to cells grown with glucose as a carbon source. P value < 0.005. The qRT-PCR analyses were done as described in Materials and Methods. *, 
not significant; — , not determined. 



buffer A with 300 miVI imidazole. The pooled eluate was then concen- 
trated to 15 to 30 mg/ml and applied to a 16/60 Superdex 200 column (GE 
Healthcare) equilibrated against buffer B {10 mM 2-[4-(2- 
hydroxyethyl)piperazine-l-yl] ethanesulfonic acid (pH 7.8), 150 mM 
NaCl, 5% glycerol, 1 mM EDTA, 1 mM dithiothreitol (DTT)}. Fractions 
with >95% purity by SDS-PAGE were pooled and concentrated to 15 to 
30 mg/ml by centrifugal ultrafiltration. Aliquots were snap-frozen in liq- 
uid nitrogen and stored at — 80° C 

Crystallization and structure solution of pdHpbD. Crystals were ob- 
tained by vapor diffusion at 1 8°C using the sitting drop vapor diffusion 
method in 96-well IntelliPlates (Art Robbins). Equal volumes of protein 
(27.5 mg/ml in 10 mM HEPES [pH 7.5], 150 mM NaCl, 5% [wt/vol] 
glycerol, 1 mM EDTA, 1 mM DTT) and crystallization buffer (30% [vol/ 
vol] polyethylene glycol 400 [PEG 400], 0.1 M MES [morpholineethane- 
sulfonic acid] [pH 6.5], 200 mM MgClj) were combined and equilibrated 
against 70 /j,l of crystallization buffer in the reservoir. Crystals were soaked 
for 5 min in the reservoir solution for unliganded structures or in the 
reservoir solution supplemented with 200 mM L-proline betaine or 200 
mM tHyp-B (trflns-4-hydroxy-L-proline betaine) for the complexes. 
Crystals were flash cooled by immersion in liquid nitrogen and subse- 



quently stored and shipped to Advanced Photon Source beamline 31 -ID 
(Lilly-CAT). Data were collected at 100 K and a wavelength of 0.97929 A. 
Data were processed using MOSFLM (26) and scaled in space group P42[2 
using SCALA (27). A monomer from a related proline betaine racemase 
(J. B. Bonanno, M. Rutter, K. T. Bain, C. Lau, V. Sridhar, D. Smith, S. 
Wasserman, J. M. Sauder, S. K. Burley, S. C. Almo, New York SGX Re- 
search Center for Structural Genomics [NYSGXRC]; PDB 2PMQ) was 
used as a search model in molecular replacement. PHASER (28), within 
the refinement package PHENIX (29), located two subunits which by 
crystaUographic symmetry generated the expected molecular octamer. 
Several rounds of manual rebuilding and ligand and water fitting within 
the molecular graphics program COOT (30) followed by refinement in 
PHENIX were performed to finalize the structure. Iodine atoms, originat- 
ing from the preparations of Pro-B and tHyp-B, were modeled into dif- 
ference density peaks with features suggestive of bound iodine. The ge- 
ometry restraints for the ligands were produced using ELBOW within 
PHENIX. L-Pro-B (substrate) and c!s-40H-D-proline betaine (cHyp-B 
[product]) were chosen as the modeled ligands in the respective com- 
plexes based on the fit to the electron density; however, the liganded 
structures are presumed to be a racemic mixture due to turnover in the 
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Pro-B N-Me-Pro Proline Glutamate 



FIG 6 Accumulation of D-proline betaine catabolic metabolites measured by LC-FTMS. (A to D) Extracted ion chromatograms of representative LC-FTMS 
runs of extracts of the P. denitrificans hpbD, hpbBl, and hpbCl and hpbB2 and hpbCZ strain, RPdl4, fed L-Pro-B for 1 h (black curves), wild-type P. denitrificans 
PD1222 grown on D-glucose (red curves), and the wild-type strain grown on L- Pro-B (blue curves) and standards (purple curves). The cells were extracted with 
10 mM ammonium bicarbonate (pH 9.2)-buffered acetonitrile (iVIaterials and Methods). Panels A to D correspond to the intensities (5-ppm mass accuracy) for 
proline betaine 144.10191 Da), N-Me-Pro (M+; 130.08626 Da), proline (M+; 116.07060 Da), and glutamate (M"; 148.06043 Da), respectively. (E) 

Averaged integrated intensities of the four peaks for the two strains, with conservation of the color coding of panels A to D. Standard errors are shown for four 
biological replicates (three for strain RPdl4); each biological replicate value was the average of the results of three technical replicates. iVIore details are given in 
>Fig. S5 in the supplemental material. 



crystal. Crystallographic refinement statistics are shown in Table S 1 in the 
supplemental material. 

LC-FTMS metabolomics. Metabolomics of metabolite extracts was 
performed using a custom IIT LTQ-FT mass spectrometer (Thermo 
Fisher Scientific) with an Agilent 1200 high-pressure LC (HPLC) system 
equipped with a Sequant Zic-HILIC column (2.1 mm by 150 mm), along 
with an extraction procedure similar to that used in the previous analysis 
of frafis-4-hydroxy-L-proline betaine catabolism by P. denitrificans (13, 
31). Preparation of the biological samples required growth of four inde- 
pendent colonies of P. denitrificans in minimal medium with glucose as 
the carbon source before subculturing each in duplicate into minimal 
media with either glucose or D-proline betaine as the sole carbon source. 



These eight cultures were grown at 30°C with aeration to an ODgoo of ca. 
0.6 before being pelleted in a refrigerated centrifuge (4,000 X gfor 8 min), 
resuspended in 1 vol of cold minimal medium lacking carbon, pelleted 
again, and resuspended in a small volume of cold minimal medium lack- 
ing carbon, measuring the OD^m, again by dilution, and aliquoting nor- 
malized amounts of cells in triplicate for each of the eight cultures equal to 
1 ml of an ODggo of 6. The cell aliquots were pelleted in 1.5-ml centrifuge 
tubes; the supernatant was discarded; and the cell pellets were stored at 
— 80°C for <2 weeks prior to analysis. 

Preparation of the hpbD, hpbBl and hpbCl, and hpbB2 and hpbC2 
genetic disruption strain, RPdl4, required growth in minimal medium 
with glucose as the sole carbon source, as this strain was expectedly unable 
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TABLE 4 Transcriptional analysis of wild-type R. sphaeroides 2.4.1" 



Fold change in gene expression 



Product category and 
gene 


Locus tag 
(RSP) 


L-Pro 


L-Pro-B 


D-Pro-B 


Glucose (0.5 M 
NaCl) 


D-Pro-B (0.5 M 
NaCl) 


Methanol 15°C 


Racemase 
















hpbD 


4007 


1.1 ± 0.2* 


9 ± 1.5 


11 ± 1.5 


-7 ± 1.0 


3 ± 0.5 


— —2.0 ± 0.3 


Demethylases 
















hpbCl 


3979 


1.0 ± 0.2* 


11 ± 1.5 


9 ± 1.6 


-8 ± 1.0 


2.1 ± 0.5 


— -4.5 ± 0.3 


hpbBl 


3978 


1.5 ± 0.4* 


10 ± 1.0 


11 ± 1.5 


-5 ± 0.4 


2.8 ± 0.4 


— -4.7 ± 0.5 


hpbA 


3861 


1.4 ± 0.2* 


10.5 ± 1.2 


10 ± 1.0 


— 5 ± 0.7 


2.7 ± 0.2 


— —5.5 ± 0.5 


Transcription factor 
















hpbR 


3977 


1.1 ± 0.1* 


1.9 ± 0.2* 


1.9 ± 0.1* 


1.4 ± 0.1* 


1.3 ± 0.1* 




Putative transporters 
















hpbj 


3998 


1.5 ± 0.3 


8 ± 1.6 


9 ± 1.5 


5.5 ± 0.3 


6 ± 0.6 




hpbl 


3999 


2.0 ± 0.5 


7.5 ± 1.5 


8 ± 1.0 


4.4 ± 0.3 


7 ± 1.2 




hpbH 


4000 


1.7 ± 0.7 


7.5 ± 0.7 


7.5 ± 0.5 


3.3 ± 0.5 


8.5 ± 0.9 




Proline catabolism 
















putA 


2166 


3.9 ± 0.4 


4.1 ± 0.5 


2.5 ± 0.5 


-1.1 ± 0.1* 


2.1 ± 0.3 




Formaldehyde 
















detoxification 
















hpbK 


2575 


LI ± 0.2* 


11 ± 2 


9 ± 1 


-1.1 ± 0.1* 


7.0 ± 1.0 


25 ± 3 


hpbL 


2576 


1.2 ± 0.3* 


11 ± 3 


10 ± 2 


-1.1 ± 0.1* 


7.5 ± 1.0 


23 ± 3 



" The analyses were conducted as described for Table 3. *, not significant; — , not determined. 



to grow with L- or D-Pro-B as the sole carbon source. RPdl4 cells grown in 
triplicate on glucose were washed in cold minimal medium lacking carbon 
as described above before being resuspended in 1 vol of minimal medium 
with D-Pro-B as the sole carbon source. These cells were incubated at 30°C 
for 1 h before being prepared as described above for normalized aliquot- 
ing (1 ml of an ODgoo of 6), pelleting, and storage at —80° C. For liquid 
chromatography- Fourier transform mass spectrometry-based metabolo- 
mics analysis, the cell pellets were each extracted with 0.375 ml of 10 mM 
ammonium bicarbonate (pFi 9.2)-90% acetonitrile, buffer B, containing 
1 /Ag/ml internal standard (acetaminophen) at room temperature. Extrac- 
tion consisted of the cells being suspended in buffer B by vigorous pi- 
petting before vortex mixing for 15 min and three rounds of centrifuga- 
tion (16,000 X g for 5 min) and decanting the supernatant into fresh 
containers, all at room temperature. Each extracted sample was injected 
three times at 100 fjl into the Zic-HILIC column preequilibrated with 
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FIG 7 Effects of L-Pro-B addition on growth of wUd-type P. denitrificans in 
minimal media at either 15°C or 30°C. Glucose (20 mM) was present in all 
cultures, whereas L-Pro-B (20 mM) was present as shown. 



100% buffer B. Buffer A consisted of water buffered by 10 mM ammo- 
nium bicarbonate (pFi 9.2). Metabolites were eluted from the column 
onto the electrospray ion source at a 200 /il/min flow rate via the following 
HPLC protocol: 2 min at 100% B, a linear gradient from 100% B to 72% B 
over 10.5 min, a linear gradient from 72% B to 60% B over 14.5 min, 
another linear gradient from 60% to 100% B over 3 min, and a 15-min 
reequUibration at 100% B. Standards (o-Pro-B, L-proline, and 
L-glutamate) were diluted into buffer B before precisely following the 
LC-FTMS method described above. All data were collected with polarity 
switching, the profile mode at 50,000 resolution, and the full scan set to 
m/z 79 to 1,000. Data analysis was performed using the Qualbrowser ap- 
plication of the Xcalibur suite from Thermo Fisher and a 5-ppm mass 
accuracy cutoff. Integrated intensities from each technical replicate were 
normalized using the integration of the internal standard (retention time 
[RT] = 3.5 min). 

Bacterial strains and growth conditions. P. denitrificans and R. spha- 
eroides wild-type and mutant strains were grown in minimal medium 
containing (per liter) 6.0 g of K2HPO4, 4.0 g of KH2PO4, 0.15 g of sodium 
molybdate, 0.2 g of MgS04-7H2O, 0.04 g of CaClj, 0.001 g of 
MnS04-2H20, and 1.1 g of FeS04-7H20 with and without 1.6 g of NH4CI 
as the nitrogen source. Cultures were grown aerobicaUy at 30°C or 15°C in 
mineral medium supplemented with glucose or succinate, L- or D-Pro-B, 
or methanol, all at 20 mM. E. coli strain TOPIO (Invitrogen) was used for 
plasmid maintenance, propagation, and cloning purposes. £. coli strain 
S17-1 was used for conjugation (32). Strains used are listed in Table S3 in 
the supplemental material. To study the role of l- or D-Pro-B in osmopro- 
tection, cultures were grown in minimal medium with glucose plus or 
minus 0.5 M NaCl. E. coli cultures were grown at 37°C in LB (Luria- 
Bertani media). Antibiotics were used at the following concentrations (in 
fig ml^i): kanamycin sulfate, 50; chloramphenicol, 35; sodium ampicU- 
lin, 100. 

Construction of gene disruption mutant strains. Gene disruption 
strains were generated in P. denitrificans and R. sphaeroides by conjugation 
(33) or electroporation (34). 

Molecular biology protocols. Chromosomal DNA was isolated from 
3 to 5 ml of P. denitrificans or R. sphaeroides cells using a DNeasy blood & 
tissue kit (Qiagen) or a Wizard genomic DNA purification kit (Promega), 
respectively. Restriction enzymes, DNA polymerases, and T4 DNA ligases 
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were purchased from New England Biolabs, Fermentas, Invitrogen, or 
Promega. Plasmids were prepared from E. coli TOP 10 cells using a Plas- 
mid Minikit (Qiagen). Standard protocols were followed according to 
user's manuals provided or following methods described in reference 35. 

Plasmid constructions for gene disruption, complementation, and 
protein overexpression. Construction of plasmids (see Table S3 in the 
supplemental material) for gene knockout in P. denitrificans was done as 
previously described (13). The putA orthologue was amplified from P. 
denitrificans genomic DNA using Pfu polymerase plus primers PDputAF 
and PDputAR (see Table S4) followed by insertion of the PGR products 
into pGEM T Easy vector (Promega). The resulting plasmids were then 
digested with Nrul and ligated to a 900-bp fragment blunted-end chlor- 
amphenicol resistance (cat) cassette. This plasmid was then used as a PGR 
template with the same primers, and the products were ligated to vector 
pSUP202, which had been digested with EcoRI and treated with the Kle- 
now fragment of DNA polymerase I plus the four deoxynucleoside 
triphosphates (dNTPs) to give the pRK14 plasmid used for gene disrup- 
tion. The R. sphaeroides hpbD gene was amplified from genomic DNA 
using Pfti polymerase plus primers RSP4007F and RSP4007R (see Table 
S4) followed by insertion of the PGR products into pGEM T Easy vector 
(Promega). The resulting plasmids were digested with EcoRV and ligated 
to a 900-bp fragment blunted-end chloramphenicol resistance (cat) cas- 
sette. This plasmid was then used as a PGR template with the same prim- 
ers, and the products were ligated to vector pSUP202 digested with EcoRI 
and treated with the Klenow fragment of DNA polymerase I plus the four 
dNTPs to give the pRRsl plasmid used to generate the gene disruption of 
strain RRsl. 

Expression plasmids and cloning, expression, and purification of 

rsHpbD. The PGR amplification was performed by using primers 
RSP4007F and RSP4007R and R. sphaeroides genomic DNA. The ampli- 
fied product was digested with Ndel and BamHI (New England Biolabs) 
and then inserted into pET17b (Novagen). The recombinant plasmids 
were transformed into E. coli BL21 (DE3) cells for protein expression. 
Transformed cells were grown at 37°G in 4 liters of LB medium and were 
harvested after 24 h by centrifugation. The cells were resuspended in 100 
ml of buffer containing 20 mM Tris-HGl (pH 8.0) and 5 mM MgGlj. The 
suspension was lysed by sonication, and debris was cleared by centrifuga- 
tion. The supernatant was applied to a DEAE Sepharose FF column and 
eluted with a linear gradient ( 1 ,600 ml) of 0 to 1 M NaGl buffered with 20 
mM Tris-HGl (pH 8.0) containing 5 mM MgGlj. Fractions containing the 
protein of interest were pooled and applied to a phenyl Sepharose 6 Fast 
Flow column in 1 M (NH4)2S04 for further purification. The protein was 
eluted with a linear gradient, 800 ml of 1 toOM (NH4)2S04, andthe purest 
fractions were pooled and dialyzed against 20 mM Tris-HGl (pH 8.0) 
containing 5 mM MgGlj. 

Mass spectroscopy screening assay for rsHpbD on proline deriva- 
tives. Screening assays were set up with the following proline derivatives: 
L-Pro, D-Pro, N-methyl-L-Pro, L-Pro-B, 4-tra«s-(OH)-L-Pro, 4-as-(0H)- 
D-Pro, N-methyl-4-ds-(OH)-L-Pro, L-Hyp-B-N-acetyl-4-fran5-(OH)-L- 
Pro, glycyl-4-trans-(OH)-L-Pro, 3-tra«s-(OH)-L-Pro, 3-ds-(OH)-L-Pro, 
pyroglutamate, and pipecolate. Assays were performed in 200 fxl DjO 
containing 20 mM NH4HGO3, 0. 1 mM substrate, and 1 fjM enzyme. The 
reaction mixture was incubated at 30°G for 16 h and analyzed for incor- 
poration of solvent deuterium as indicated by a + 1 mass shift by electro- 
spray ionization (ESI)-MS. 

NMR assay for rsHpbD activity on Pro-B and Hyp-B. 'H nuclear 
magnetic resonance (NMR) spectra were recorded using a Varian Unity 
Inova 500-MHz NMR spectrometer. ^H NMR data of racemization and 
epimerization were recorded in a DjO-containing buffer at 30°G after 16 
h of incubation to allow deuterium incorporation. A typical reaction mix- 
ture contained 10 mM substrate (l- or D-Pro-B and tHyp-B or cHyp-B), 
25 mM sodium phosphate buffer (pD 8.0), and 1 jiiM enzyme. 

Polarimetric assay for betaine racemase and epimerase activity. Be- 
taine racemase and epimerase activity was measured at 25°G by quantitat- 



ing the change of optical rotation. The assay was performed in a 50-mm- 
path-length cell in a total volume of 0.8 ml using a Jasco P-1010 
polarimeter with an Hg 405-nm-pore-size filter. Buffer conditions for the 
assay were 50 mM Tris-HGl (pH 8.0) and 10 mM MgGl2. 

Syntheses of betaines. Betaines were synthesized by a slight modifica- 
tion of the published procedure (12). A typical reaction mixture for the 
synthesis of L-Pro-B, D-Pro-B, tHyp-B, and cHyp-B was as follows: 
L-proline (4.951 g, 0.043 mol) and NaOH (5.160 g, 0.129 mol) were dis- 
solved in 50 ml of dry methanol. Methyl iodide (8.0 ml, 0.129 mol) was 
added to the solution, and the reaction mixture was refluxed with stirring 
for 6 h. An additional 3 ml of methyl iodide was added to the reaction 
mixture, and the reaction was allowed to proceed another 10 h. The sol- 
vent was removed by the use of a high-vacuum rotary evaporator over a 
40°G bath. The light yellow solid was washed with ~2 liters of acetone and 
dried by lyophilization. The product yield was >95%, and each product 
was characterized by 'H NMR (500 MHz; DjO). The purity of the prod- 
ucts was determined by optical activity measured by polarimetry. 

Transcriptomics cell preparation. P. denitrificans and R. sphaeroides 
wild-type or mutant cultures were grown in 10 ml of minimal media with 
20 mM glucose as the carbon source to an ODgQQ of 0.4. The cells were 
pelleted down by centrifugation (4,000 X g, 10 min, 4°G). Gell pellet was 
washed twice and resuspended in 10 ml of minimal media lacking a car- 
bon source. The cultures were divided into two 5-ml samples. Glucose or 
succinate was added to one sample and L-proline, L- or D-Pro-B, or meth- 
anol was added to the second sample as the sole carbon source followed by 
aerobic growth at 30°G for 1 h prior to cell harvest. The effects of salt stress 
on gene expression were tested by addition of 0.5 M NaGl (final concen- 
tration) to the minimal media. 

RNA sample preparation. For preparation of RNA samples, 1 to 5 ml 
of an actively growing P. denitrificans and R. sphaeroides culture (OD^qq of 
0.5 to 0.6) was added to 2 volumes of RNAprotect Bacteria Reagent (Qia- 
gen). After vortex mixing for 10 s, the solution was incubated for 5 min at 
room temperature. The cells were pelleted by centrifugation (10,000 X g 
for 5 min at 4°G), the supernatant was decanted, and the remaining liquid 
was removed. RNA isolation was performed on ice using an RNeasy Mini- 
kit (Qiagen), following the protocols for bacteria given by Qiagen. RNA 
concentrations were determined by absorption at 260 nm, with 1 absor- 
bance unit corresponding to 44 /j,g ml^ ' RNA. Isolated RNA was analyzed 
by agarose gel electrophoresis and spectrophotometricaUy in a Nanodrop 
instrument (Thermo Scientific) using the ratios Ajgo/Ajgo and ^2^0^230 
and the Ajj^Mjj^ absorption spectra to assess sample integrity and purity. 
RNA preparation purity and absence of DNA were validated by agarose 
gel electrophoresis and control PGR reactions. The RNA preparations 
were stored at — 80°G until use. 

qRT-PCR. Reverse transcription (RT) was performed on 1 /iig of total 
RNA by using a RevertAid H Minus First Strand cDNA synthesis kit (Fer- 
mentas) per the manufacturer's protocol. Of the cDNA, 1 fjl was used in 
separate PGR reactions of 20 fA for each gene. Minus-RT controls were 
performed to test for genomic DNA contamination in each RNA sample. 
Primers were designed using a Universal Pro-Be library system (Roche 
Applied Science). The primer lengths were 21 to 27 nucleotides and had a 
theoretical melting temperature (r„,) of 58 to 60°G. The amplicon size 
ranged from 66 to 1 10 bp. Real-time PGR was carried out in 96-well plates 
using a Roche LightGycler 480 instrument. The lO-fjl PGR mix was pre- 
pared by adding to 1 fjl of cDNA template 2 fjl of forward and reverse 
primers (final concentration, 150 nM each primer) and 10 /xl of SYBR 
2 X -concentration green Master Mix (Roche). The PGR running condi- 
tions were 1 cycle at 95''G for 5 min, 40 cycles of amplification at 95°G for 
15 s, and 60°G for 1 min. Subsequently, a dissociation program was ap- 
plied with one cycle at 95°G for 1 5 s, 60°G for 1 min, and 95°G for 1 5 s. The 
efficiency of all the primers used in qRT-PGR was calculated as 97% ± 
2%. The gene expression data were expressed as cross-point (Gp) values. 
The 16S rRNA gene was used as a reference gene. The data were analyzed 
by the 2^'^'^'-'^ (Livak) method (36). The data represent averages of the 
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results of five biological replicates. Primer sequences will be provided 
upon request. 

Nucleotide sequence and atomic coordinate and structure factor ac- 
cession numbers. The sequences described were submitted to GenBank 
under the accession numbers given in Table S2 in the supplemental ma- 
terial. The atomic coordinates and structure factors for unliganded pdH- 
pBD and the complex of pdHpBD with L-Pro-B and cHyp-B have been 
deposited in the Protein Data Bank (PDB codes 4E8G, 4J10, and 4IZG, 
respectively; http://www.pdb.org). 
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